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Abstract. We analyze the connection between the autocorrelation of a bi- 
nary sequence and its run structure given by the run length encoding. We show 
that both the periodic and the aperiodic autocorrelation of a binary sequence 
can be formulated in terms of the run structure. The run structure is given 
by the consecutive runs of the sequence. Let C = (Co, Ci, ■ • • , C n ) denote the 
autocorrelation vector of a binary sequence and A the difference operator. We 
prove that the fcth component of A 2 (C) can be directly calculated by using 
the consecutive runs of total length k. In particular this shows that the fcth 
autocorrelation is already determined by all consecutive runs of total length 
I < k. In the aperiodic case we show how the run vector R can be efficiently 
calculated and give a characterization of skew-symmetric sequences in terms 
of their run length encoding. 



1. Introduction 

Let n be a positive integer and and let a = (ax, a 2 , ■ ■ • , a n ) be a (finite) sequence 
of real numbers. The length n of the sequence a will be denoted by \a\. a is called 
a binary sequence if <2j G { — 1,1} for all i = 1, • • • , n. In the following we analyze 
in detail the connection between the autocorrelation of a binary sequence and its 
run structure given by the run length encoding. Binary sequences with suitable 
autocorrelation properties play an important part in a wide range of different en- 
gineering applications. For example they are used in signal processing in order to 
detect signals in a noisy background. The autocorrelation measures the similarity 
between the original sequence and its translate. In many applications it is of in- 
terest to collectively minimize the absolute values of the off-peak autocorrelations. 
For a survey on this topic we refer to [2 [21 [3]. Depending on the type of applica- 
tion there are two types of autocorrelations commonly used: the aperiodic and the 
periodic autocorrelation. 

For k = 0, 1, • • • ,n— 1 the fcth aperiodic autocorrelation is given by 

n — k 

(1.1) C fc (a) := ^2 aia i+k . 

i=i 

In the periodic case put a n+i :~ ai for i > 1; for k = 0, 1, • • • , n— 1 the fcth periodic 
autocorrelation is then defined by 
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(1.2) C k {a) :=y^aia i+k . 

»=i 

In the following we will additionally put C n (a) ■= and C n (a) := n. Note that 
for a binary sequence the peak autocorrelation equals the length of the sequence: 
Co (a) = Co(a) = n. Note further that for k — 1, ■ ■ ■ ,n we have C k {a) = C n - k {a) 
and 

(1.3) C k (a)=C k (a) + C n - k (a). 

In the following we analyze in detail the connection between the autocorrela- 
tions of a binary sequence and its run structure. A run is defined as a substring of 
maximal length where all elements have the same value. Runs as well as autocorre- 
lation values were used in p|] in order to measure apparent randomness in a binary 
sequence (according to jS] these randomness postulates first appeared in [5]) . 

We prove that both the periodic and the aperiodic autocorrelation of a binary 
sequence can be formulated in terms of the run structure. As we will see the 
run structure is given by the consecutive runs of a. Consecutive runs of a with 
total length k determine R k , the fcth element of R; we call R the run vector. If 
C := (Co (a), Ci(a), ■ ■ • , C n (a)) denotes the autocorrelation vector of a and A the 
forward difference operator, then we will prove that A 2 (C) = — 2R. In particular 
this shows that the consecutive runs of total length up to k — 1 determine the kth 
autocorrelation. 

For the periodic case this was established in [TJ. The objective of this paper is 
twofold: to simplify the proof in [7j and to find a similar relationship for the ape- 
riodic autocorrelation. We start with the latter; in the first part of this paper the 
aperiodic case is considered and it is shown how the aperiodic autocorrelation of a 
binary sequence can be formulated in terms of the run structure. In the aperiodic 
case we show how the run vector can be efficiently calculated based on a simple 
algorithm and we derive a further practical formula for calculating the elements 
of the run vector. Furthermore, we give a characterization of skew-symmetric se- 
quences in terms of their run length encoding. Finally, we consider the periodic 
case. The presented proof of the aperiodic case can also be applied to the periodic 
case with minor modifications, resulting in a new simplified and more direct proof. 
This paper, however, follows a different and even shorter approach: the results for 
the periodic case are directly derived from the aperiodic case by using (| 1 . 3[) . 

After the completion of this paper the author became aware that a different run 
correlation technique for the aperiodic case was published in [5]. There a tabular 
arrangement for the correlation calculation was developed by using a set of sequen- 
tial relations which relate the aperiodic autocorrelation to the run structure. This 
is essentially the result of Theorem Q] which we prove in the first part of this paper. 
In [9] these results were used for a search strategy in order to construct binary 
sequences with specified aperiodic autocorrelation values by explicitly eliminating 
large subsets of binary sequences and thus reducing the search space. 
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2. Preliminaries 

In the following a will always be a fixed binary sequence of length n. Further- 
more, in the aperiodic case we will always put a := and a n+ i := in order to 
circumvent boundary problems. 

A substring of a always represents a non-empty contiguous part (cij, a, + i, • • • , a^-i) 
of a with l<z<j<n+l; quite similar to the concept of an half-open interval it 
will be denoted by a(i,j). In the following we will also always distinguish between 
the substrings a(i,j) and a(i',j') whenever ^ Hence a has n '( n + 1 '> 

different substrings. Note that by our definition a substring is always non-empty 
and that the length |a(i, j)| of the substring a(i,j) is given by |a(i,j)| = j — i. If 
1 < i and j < n + 1, then a(i,j) is called an inner substring, otherwise a(i,j) is 
called an outer substring. 



2.1. Runs, Run Blocks and Run Length Encoding. A run of a is a substring 
of a with maximal length where all elements have the same value. Thus if a(i,j) 
is a run of a, then a,_i ^ <Xj = a. i+ i = ■ ■ ■ = aj_i ^ aj. In the following 7 will 
always denote the total number of runs of the sequence a. Consecutive runs form 
what we will call a run block. Thus for 1 < i < j < n + 1 a substring a(i,j) is a 
run block of a, if and only if a^x ^ di and aj_i 7^ aj. Let for example a be the 

binary sequence of length 13 given by a = (+ + + + + + H h + +); here 

and in the following the symbol '+' stands for 1 and the symbol '— ' for -1. In this 
case we have 7 = 3 since a has three runs, namely a(l,8), a(8, 11) and a(ll, 14). 
Furthermore, a has a total of six run blocks: there are the three run blocks a(l, 8), 
a(8, 11) and o(ll, 14) consisting of just a single run, there are the two run blocks 
a(l, 11) and a(8, 14) consisting of two consecutive runs and there is the run block 
a(l, 14) which is a itself consisting of three consecutive runs. 

For an inner run block a(i,j) (i.e. a run block which is an inner substring such as, 
for example a(8, 11) of the previous example) we have = — a, and = —aj. 
Note since a = a„ +1 = the sequence a itself is represented by the run block 
a(l,n + 1) and that each run block of a can be uniquely divided in runs of a. 
In particular, the sequence a can be uniquely divided in 7 runs a(ik,ik+i) with 
1 = ii < 12 < ■ ■ ■ < i-y+i = n + 1; a{ik,ik+i) is then called the kth run of a. If 
we put ru := |a(ik,ik+i)| = ik+i - ik, then the sequence r = (ri,r 2 ,--- ,r 7 ) is 
called the run length encoding of a. Note that beside a only the binary sequence 
(— 01, — <i2, • • • , — a n ) has the same run length encoding as a. 

Let for example a be the binary sequence of length 13 given by a = (+ + + + + + 

), then 7 = 2 and the run length encoding r of a is given by r = (6, 7). 

In this case a has two runs and three blocks. If a = (+ + + + + + H then 

the run length encoding of a is given by r = (7, 3, 3) and as already noted a has three 

runs and six run blocks. The binary sequence of a = (+ + H h + H ) 

has four runs and its run length encoding is given by r = (3, 6, 3, 3). 

As we will see later the weight of a run block will be used in order to calculate 
the autocorrelation. The weight w of a substring a(i,j) is defined by 

{2cii • Oj_i if a(i,j) is an inner run block 
ai ■ aj-i if a(i, j) is an outer run block 
otherwise. 
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Note that for a substring b we have w(b) = unless & is a run block. In this 
case \w(b)\ = 2 if b is an inner and \w(b)\ = 1 if b is an outer run block. We 
remark further, that if a run block b — a(i,j) consists of m consecutive runs, then 
dj • cij-i = —(—1)™ and thus w(b) > if m is odd and w(b) < if m is even. 

2.2. The Aperiodic Run Structure. In the following r will always denote the 
run length encoding of a with r& = ik+i — *fc and 7 as defined above. Let us define 
the aperiodic run structure TZ of a as the set of all substrings of r. 

Next we want to show that there is a one-to-one correspondence between the 
run blocks of a and the substrings of the run length encoding r. If 6 is a run block 
of a of length fc, then we have b = a(i p , i q ) for some l<p<q<7+l and the run 
block b = a(i pi i q ) corresponds to the substring r(p,q). The run length encoding 
gives us therefore a mapping $ from the set B of all run blocks of a to the aperiodic 
run structure TZ of a defined by $(a(i p , i q )) — r(p, q). Note that <E> is bijective and 
that $ maps an inner run block of a to an inner substring of r. Furthermore for 
k = 1, • • • , n let Bk denote the set of all run blocks of a with length k and let 
TZk denote the set of all substrings r(p, q) of r whose sum Y^= P r o 1S equal to k; 
obviously Bk C B and TZk C 7?.. If 6 = a(i p ,i q ) is a run block of a with length k, 
then we have 

9-1 

(2.2) j g - j p = yVj = fc 

and hence 

(2.3) <S>{B k )=llk, 

Thus each element u £ TZk (i>e. each substring u of r whose sum is equal to k) 
corresponds uniquely to a run block of length k consisting of \u\ consecutive runs, 
and vice versa. 

For a substring u of r let 



(2.4) a(u) 



2 if u is an inner substring of r 
1 otherwise. 



If the run block a(i p , i q ) consists of m consecutive runs, then a,i p -Oj _i = — (— l)" 1 
as already noted. Since ^>(a(i p ,i q )) = r{p,q) we have m — \r(p,q)\ and it follows 
from (|2.ip that w(a(i p ,i q )) = —a(r(p,q)) ■ (— l)l r dwl. Since the mapping $ is 
bijective, (|2.3I1 together with the last remark of the previous subsection gives us 



that 

(2.5) ]T a(u) ■ (-l)l«l =-J2 ««(&)■ 

3. The Main Result for the Aperiodic Case 

In this section we want to analyze the connection between the aperiodic au- 
tocorrelations Cfc(a) of the binary sequence a and its run structure given by the 
run length encoding r = (ri,^,-- - ,r 7 ). Note that r j = \a\ = n = Cq{o). 

Moreover, we have 

(3.1) Ci(a) = ra+l-2 7 
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since for each j — 1,2, • ■ • ,j — 1 the jth run of a contributes rj — 2 to the sum 
Ci(a) whereas the last run contributes r 7 — 1 to the sum Ci(a). Hence we have 

Ci{a) = Ej=x fa - 2) + (r 7 - 1) = 1 + Ej=ifa - 2) = 1 + n - 2 7 . 
For k = 1, 2, • • • , n — 1 put 

(3.2) := ^ a («) • 

and R(a) :— (Ri, R2, ■ • • , i?„-i); we call i?(a) the run vector of a. Thus by (|2.5p 
we have 

(3.3) i? fc - - ^ w(6). 

Therefore, in order to calculate R k all consecutive runs of a with a total length of 
k (i.e. all run blocks of a with length k) have to be considered. Each run block b of 
length k contributes the (negative) weight —w(b) as defined in (|2.ip to the sum in 
(|3.3p . As already noted |u>(6)| equals 1 if & is an outer run block, \w(b)\ equals 2 if 
b is an inner run block and the sign of w(b) depends only on the number of runs: 
we have w(b) > if the run block b consists of an odd number of consecutive runs 
and w(b) < if b consists of an even number of consecutive runs. 

Let us for example compute the run vector i?(a) for the three sequences of the 
previous example. If r = (6,7), then apart from a itself there are only two run 
blocks; they both consist of a single run, their length is 6 resp. 7 and both are outer 
run blocks. Thus we have K k = for k ^ 6, 7 and K 6 = {r(l, 2)}, K 7 = {r(2, 3)}; 
hence by $2$ R(a) = (0, 0, 0, 0, 0, -1, -1, 0, 0, 0, 0, 0). 

For r = (7, 3, 3) there are three run blocks consisting of a single run; their length 
is 7, 3 and 3, their weight -1, -2 and -1 and they correspond to the substrings r{l, 2), 
r(2,3) and r(3,4). In addition there are two (outer) run blocks consisting of two 
consecutive runs. Their length is 10 resp. 6, both have weight 1 and they correspond 
to the substrings r(l,3) and r(2,4). Altogether, we have IZ3 — {r(2, 3), r(3, 4)}, 
TZq = {r(2, 4)}, TZy = {r(l, 2)}, H w — {r(l, 3)} and TL% = for the remaining cases 
k = 1,2, 4, 5, 8, 9, 11, 12; this shows that R(a) = (0, 0, -3, 0, 0, 1, -1, 0, 0, 1, 0, 0). 

Finally, we consider the example r = (3, 6, 3, 3). In order to calculate for instance 
Rq we have to consider all consecutive runs of a which have a total length (i.e. run 
block length) of 6. There are exactly two run blocks of a with length 6. The first one 
consists of a single run, has weight -2 and corresponds to the inner substrings r(2, 3); 
the second one consists of two consecutive runs, has weight 1 and corresponds to 
the outer substring r(3,5). This shows that 71$ = {r(2, 3), r(3, 5)} and Rq = — 1. 
Similarly, we have TZ 3 = {r(l, 2), r(3, 4), r(4, 5)}, K 9 = {r(l, 3), r(2, 4)}, TZ U = 
{r(l, 4), r(2, 5)} and IZk — if k is not a multiple of 3. Hence it follows that 
R(a) = (0, 0, -4, 0, 0, -1, 0, 0, 3, 0, 0, -2, 0, 0). 

The next theorem (cf. [8]) is the main result for the aperiodic case. It shows 
that for a binary sequence a the aperiodic autocorrelations and and the run vector 
R(a) are closely related. 

Theorem 1. Let k = 1, ■ ■ ■ , n — 1; then 

C k+1 (a) - 2C k (a) + C fc _i(a) = -2R k . 

Proof. Let 5 := (Si, 62, ■ • • , be the sequence defined by Si := a, — a^i for all 

i = 1, 2, • • • ,n + 1. Since a(i,j) is a run block of a if and only if dj_i ^ ai and 
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cij-i i= a,j it follows that 

(3.4) is a block of a 4=» Si ■ Sj ^ 

Now let 1 < i < j < n + 1 with j — i < n. If a(i,j) is a run block, then 
did,- + di-ia,-! = —aiCij-i — ai_ia,; with (13.41) this shows that 



Hence by 



<5j<5j = (ai - a i ^ 1 )(a J - ay-i) 

= CLiCLj — CLiCLj — l — CLi^lCLj + tt^—lttj—l 

= -2(a i a J _ 1 + ai-idj) 
= -2w(a(i,j)) 

2R k = 2 a ( u ) ■ = - 2 w{b) 

u£ll k b£B k 
n+l-fc 

= -2 V w(a(i, i + k)) 

i=l 
n+l-k 

= 5i8 i+k = C k (6). 

i=l 

Now Theorem [1] follows directly from the following lemma. □ 

Lemma 2. Let S = (S\, 62, ■ • • , ^n+l) the sequence defined by Si :— ai — a^-i for 
all i = 1, 2, ■ • • , n + 1; i/ien for fe = 1, 2, • • • , 71 — 1 

C fe+ i(a) - 2C fc (a) + C fc _i(o) = -C fe (5). 

Proof. Let 1 < fc < n — 1; then 

n+1 — fc n+1 — k 

= ^ <Wi+fc = ^ (ai — ai-l)(fli+fc — aj+fc-l) 
i=l i=l 
n+l-fc 

= J]] aiOi+fc — fli a i+fc-l — CLi-lCli+k + ttj-lOli+fc-l 

8=1 

=(Cfc(a) + a„ + i_ fc a n+ i) - C fe _i(a) 

- (a a fc+ i + C fc+ i(a) + an-fca^+i) 

+ (a a k + Cfe(a)) 
= -C fe+1 (a) + 2C fe (a)-C fc _ 1 (a). 

□ 

We have Co (a) = n and Ci(a) = n + 1 — 27 by (|3 . 1 ; Theorem [1] gives us that 
C 2 (a) = 2Ci (a) - n - 2Ri and thus C 2 (a) = n + 2-ij-2R 1 . Furthermore Theorem 
□ shows that C fc+ i(a) = 2C k (a) - C fe _i(a) - 2i? fe for fc = 1, 2, • • ■ , n - 1. 

Let us denote by C(a) := (Co(a), Ci(a), ■ ■ ■ ,C n (a)) the aperiodic autocorrela- 
tion vector of a. We can now use Theorem Q] in order to compute C(a) for the 
three sequences of the previous examples. As we have seen, if r = (6, 7) then we 
have R(a) = (0,0,0,0,0,-1,-1,0,0,0,0,0). It follows that C (a) = n = 13, 
Ci(a) = n + 1 - 2 7 = 13 + 1-2-2 = 10, C* 2 (a) = 2d(a) - n - 2R 1 = 
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20 - 13 = 7 , C 3 (a) = 2C 2 (a) - Ci(a) - 2i? 2 = 4 and so on, which gives us 
C(a) = (13, 10, 7, 4, 1, -2, -5, -6, -5, -4, -3, -2, -1,0). Furthermore, by a simple 
calculation we get C{a) = (13, 8, 3, -2, -1, 0, 1, 0, 1, 2, 3, 2, 1, 0) if r = (7, 3, 3) and 
C(a) = (15, 8, 1, -6, -5, -4, -3, 0, 3, 6, 3, 0, -3, -2, -1, 0) if r = (3, 6, 3, 3). 

Theorem Q] can be rephrased by using the difference operator. The (forward) 
difference operator A(6) of a sequence b = (b\, &2, • • • , b m ) with to > 2 is given by 
A (6) := (& 2 - 6i, 63 - &2, • ■ • , &m - &m-i)< For to > 3 we have A 2 (6) := A(A(6)) = 
(63 - 26 2 + &i, bi - 26 3 + 6 2 , • • • , b m - 2b m -i + & m - 2 )- 

Corollary 3. A 2 (C(a)) = -2R(a) for n > 3. 

Proof. This is just a reformulation of Theorem [TJ □ 

Remark. If we put a := (0, a±, 02, • • ■ , a n , 0), then Lemma [5] shows that for n > 3 

A 2 (C(a)) = -(Ci(A(a)),C 2 (A(a)),--- ) C n _i(A(a))). 

A more explicit relationship between the aperiodic autocorrelation and the run 
vector gives the next result. Note that by (|3.1| Ci(a) — Co (a) = 1 — 2j. 

Corollary 4. Let k = 0, 1, • ■ ■ , n; i/ien 

fc-i 

Cfc(a) =n+(l-2 7 )fe-2^(fe- 

i=i 

Proof. Since Co (a) = n and by (|3.1|) Ci(a) = n + 1 — 27 the statement is true 
for fe = 0, 1. For k > 2 it follows directly from A 2 (C(a)) = — 2R(a) and the next 
Lemma. □ 

Lemma 5. For m > 3 let b — (61 , 62, ■ ■ • , b m ) be a sequence of real numbers and 
let €j denote the jth component of A 2 (6) ; i.e. A 2 (6) = (ei, e%, ■ ■ ■ , e m _2)j then for 
k = 0, 1, • • ■ , to — 1 

fc-i 

bfe+i = 61 + (6a - &i)* + Y,( k ~ j) e i- 

Proof. This can be easily proved by induction on fc. □ 

Next we want to show that based on the symmetry of the operator A 2 a 
complementary formulation of Corollary U can be derived. If b and A 2 (b) = 
(ei, 62, ■ ■ ■ , e m -2) are defined as in Lemma[5j then we also have for k = 0, 1, • • • , m — 
1 

m-2 

(3.5) = b m + (6 m _i - 6 m )(m — k - 1) + ^(j'-fc)e,-. 

The proof is similar to the proof of Lemma [5] By applying this to A 2 (C(a)) = 
— 2R(a) and noting that C„_i(a) = (— 1) 7+1 and C n (a) = it follows as in the 
proof of Corollary 3] that for k = 0, 1, • • • ,n 

n-i 

(3.6) C fe (a) = (-l) 7+1 (n - fc) - 2 £ (j - fc)i?,, 

j=k+i 
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Similarly, from efc = bk+2 — 2bk+i + bk it easily follows that Y^JLi 2 e 3 = bi — 
&2 + b m — b m -\. Applying this to A 2 (C(a)) = — 2R(a) gives us that 2j2k=i ^* = 
C (a) - Ci(a) + C„(a) - C„_ x (a) = 1 - 2 7 + (-1P+ 1 and hence 



n-l 

(3-7) 



—7 if 7 even 
1 — 7 if 7 odd. 



4. Applying the Results 

In this section we will first present an example where the run vector R(a) is 
used in order to establish a relationship between the correlations of certain related 
sequences. Next we show how the run vector R(a) can be efficiently calculated 
based on a simple algorithm. After that we develop an alternative formula which 
shows how Rk can be expressed in terms of the sums r± + Ti + • • • + Tj . The rest 
of this section discusses skew-symmetric sequences and their run length encoding. 
For a skew-symmetric sequence a we have Cfc(a) = whenever k is odd; if k > is 
even, then we will show that Cfe(a) = i?fe(a). Moreover, we give a characterization 
of skew-symmetric sequences in terms of their run length encoding. 

4.1. A First Example. For to > 1 we consider the sequence b = (61, 62, • • • , b nm ) 
obtained by repeating each element of a exactly to times so that the run length 
encoding r(b) of b is given by (mri,mr2, • • • , mr 7 ). For example to = 2 gives us 
b = (01, ai, 02, CI2, • • ■ , a„, a n ). We will show that for < k < n and < s < to the 
aperiodic autocorrelations of b are given by 

(4.1) C km+S (b) = (to - s)C k (a) + sC k+1 {a) 

where we have put as before C n (a) := 0. 

For < k < n and < s < to put Dk m +s ■— (m — s)Ck(a) + sCk+i{a), D nm = 
and D := (Do, D\, ■ ■ ■ , D nm ). In order to prove (|4.ip using Lemma[5]and Corollary 
[3] it is sufficient to show that 

C (b) = D , Ci(6) = L»i and A 2 (L>) = -2R(b). 

We have C Q (b) = mn = mC (a) = D Q and by (|3Jll Ci(6) = mn + 1 - 27 = 
(to— l)n+n+l— 27 = (to— l)Co(a)+Ci(a) = D\. Since r(6) = (mri, mr2, • • ■ ,mr 7 ) 
it is not difficult to see that for l<j< nm 

R± (a) if j = mod to 
otherwise. 



i?,(6) 



Now let A 2 (L>) = (ei,e 2 , ••• ,£™-i)- For < k < n we have D km+S+1 - 
D km+S = Ck+\{a) — Cfe(a) for < s < m — 1 and also for s = m — 1. Since 
A 2 (D) := A(A(D)) we have €j = unless j is a multiple of to; in this case 
we have ti. m = C+i(a) — 2Cj(a) + C_i(a) = — 2Ri(a) = —2Ri. m (b). Hence 
A 2 (L>) = -2R(b). 

4.2. Calculation of the Run Vector. The results of the previous section show 
how the run structure and the aperiodic autocorrelations are related. The main 
result which says that A 2 (C(a)) = —2R(a) can be used in order to calculate the 
autocorrelation vector C(a). However, the presented form, in particular (|3.2| . is not 
very well suited for practical purposes. However, R(a) can be efficiently computed 
based on the following simple algorithm, which can be easily derived from (|3.2| . 
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As a precondition we will assume that the run length encoding r and its length 
7 are known and that each component of the array R is initialized to zero. After 
the final step of the algorithm the array R contains the computed run vector R(a). 
The algorithm computes R(a) in two steps. The first step considers only the outer 
substrings of the run length encoding r of a and uses the fact that |r(l, j)| = 
7 - k(j,7+ !)l : 

a < 1 

s <- 

for j = 1 to 7 — 1 do 

S <— S + Tj 

R s <- i? s + a 

i?„-s «- i?„-s + 7 • a 

a i a 

end for 

The second and final step of the algorithm takes into account all inner substrings 
of r: 

for i — 2 to 7 — 1 do 

a i 2 

s <- 

for j = i to 7 — 1 do 
s <— s + r j 
i? s ^- R s + a 

a < a 

end for 
end for 

In the first step of the algorithm there are 7 — 1 iterations and we have a total of 
(7-1H7-2) jt era ti ons in the second step. In the second step by enrolling the inner 
loop into two separate loops any multiplication can be avoided; the same is true 
for the loop in the first step. Thus the computation of R(a) requires 4(7 — 1)+ 
(7 — 1) (7 — 2) = (7 — 1)(7 + 2) additions and no multiplications. 

Let us compare this with the direct calculation of the autocorrelations C\ (a), Ci (a), 
• • • , C„_i(a) as defined in which requires multiplications and t ^ n ~ 1 )^ n ~ 2 ) 

additions. Now, at this point let us assume that 7 ~ ^; we will come back to this 
assumption below. If 7 w § then the above algorithm for the computations of 

R(a) requires approximately \- additions and no multiplication. Even if we do not 
distinguish between the cost of an addition and a multiplication, then the above 
algorithms should be approximately four times faster than the direct calculation 
using (jl.ll) ; a very similar result can be found in [5] . 

At least for large n this remains true, even if we consider the additional cost in 
order to compute C(a) and the run length encoding r of a: if — 2R(a) is already 
computed then we need less than 2n additions in order to calculate the autocorre- 
lation vector C(a) using A 2 (C(a)) = — 2R(a) and not more than n + 7 additions 
are necessary in order to calculate r. 

Let us come back to the assumption that 7 « ^. Note that this is true for 
most binary sequences if n is large. If, however, 7 > then consider the binary 
sequence a of length n one gets, when inverting every second element of a, i.e. 
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Hi := en if i is odd and en :— —a,i if i is even. Then Ckifl) = Ckifl) if k is even 
and Cfe(a) = — Cfc(a) if fc is odd. By induction it is not difficult to show that 
7 + 7 = n + 1 where 7 denotes the length of the run length encoding of a. Thus if 
7 > , then we have 7 < § and a has up to every second sign the same aperiodic 
autocorrelation vector. So in the above algorithm a instead of a can be used in this 
case in order to compute the autocorrelation vector C (a) . 

4.3. A More Practical Formula for Rk- In this subsection we develop a formula 
which shows how Rk can be expressed in terms of the sums r± + r 2 + • • • + r j ; 
this gives rise to a different algorithm for calculating Rk which is for example 
particularly useful in a branch-and-bound like exhaustive search where only parts 
of the sequence a are known. 
For j = 1,2,..- ,7 let 

(4.2) Sj := n + r 2 + ■ ■ ■ + rj 
and 

(4.3) tj := r 1 + r 7 _i H h r 7 _,-+i. 

Note that then 1 < si < s% < ■ ■ ■ < = n, 1 < t\ < t<i < • • • < ty = n and that 
farj = l,2,..., 7 -l 

(4.4) Sj + try-j = n. 

Furthermore, let 

(4.5) S := {si,s 2 , ••• ,s 7 _i} 
and 

(4.6) T:={h,t 2 ,--- ,* 7 -i}- 
The functions fs, It ■ Z— >{— 1,0,1} defined by 



(4-7) fs(k) := 



(-iy if fe e 5 with fc = Sj 

otherwise 

'(-l) j if fc G T with fe = tj 

otherwise 



(4-8) f T (k) 

will play an important role in the following. Note that by (|4.4| we have for all fcG Z 
(4.9) f s (k) = (-iyf T (n-k). 

The next theorem shows how Rk can be expressed in terms of Si, S2, • • • , s 7 _i. 
Theorem 6. Let fc = 1, 2, ■ ■ • , n — 1; then 

7-1 

Rk = fs(k) + (-iy<fs(n -k) + 2J2(-I) j fs(sj - k) 

i=i 

Proof. As in the proof of Theorem Q] let S := (§1,62,- ■■ ,S n +i) be the sequence 
defined by Si := a% — aj-i for all % = 1, 2, • • • ,n + 1. Then 5i =01, <5 n +i = (— l) 7 ai 
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and for i = 1, 2, • • • , n — 1 we have 2fs(i) — ai<5j+i. Let 1 < k < n. Hence by 
Theorem [1] and Lemma [2] 

_^ _^ n+l— 



2 

i=l 



j n — 

^(<5i<5fc+i + 5 n+ i-k5 n+ i + 2J ) 



i=2 
n— fc— 1 



= / s (fe) + (-l)7/ s („_ fc ) + 2 ^ /s(i)/s(» + *). 

Since /s(«) = for i > n, Theorem[5]follows directly from the following Lemma. □ 

Lemma 7. Let k = 1, 2, • • • , n — 1; then 

7— l 7— 1 

j2(-i) j -fs(sj-k) = j2(-iy-fs(s 3 +k) 

3 = 1 3=1 

n-1 



Proof. We have 

7— 1 7— 1 

3=1 j=i 

n-1 n— 1 

E /s(0 • fs(i ~k) = J2 fs(i + ■ fs(i) = 

7 — 1 7— 1 

E + fc ) ■ ^(*,o = E M*j + fc ) ■ 



□ 



can be expressed by Theorem [5] as the sum of 7 + 1 terms. Each of these 
terms can be easily computed; for example by means of a pre-calculated array which 
holds the values fs{i) for i — 1, 2, • • • n — 1. As before we may assume that 7 ~ § 
and that n is large. But then, on average (taken over all binary sequences of length 
n and all 1 < k < n) only about a quarter of these terms are not zero, since on 
average we have |{1 < j < 7 : Sj — k > and /s(sj — k) ^ 0}| ~ 

If only the first and the last part of the binary sequence a are known, then we 
will see that a reformulation of Theorem [5] is useful. 

Corollary 8. Let k = 1, 2, • ■ • , n — 1; then 

7-1 

(-1)^-* = fs(k) + f T (k) + 2E(-l) J Jr(fc - Si)- 

Proof. The result follows directly from Theorem [51 Lemma [7] and (14.91) . □ 

Now let 1 < k < m < n and let us assume that the first and the last m elements 
of a are known. Then all Sj £ S with Sj < m and all tj £ T with tj < m can 
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= {4} 


Zio 


= {5} 


To 


= {4} 



be easily determined; the same is true for fs(i) and /t(z) for all i < m. Hence 
Corollary [8] can be applied in order to calculate Rn-k- In particular, this can be 
applied in a branch- and-bound like exhaustive search (as for example in [TO]) in 
order to find binary sequences with low autocorrelations. 

As an example let r = (5, 2, 2, 1, 2, • • • , 5, 3, 1, 4) and assume that only the first 
12 and the last 12 elements of a are known; let us assume further that 7 is even. 
Thus we know that (si, S2, S3, S4) = (5,7,9,10), (ti,t 2 ,t 3 ) = (4,5,8), S5 > 12 and 
<4 > 12. Thus fs(j) has for j = 1, 2, • • • ,11 the following values: 

0000 -1 010 -1 10 

and /t(j) has for j = 1, 2, • ■ • , 11 the following values: 

000 -1 100 -1 000 

For k < 12 let S k := {k - s > : s € 5} and T k := 5 fe n T then 

Sn = {6,4,2,1} 
S10 = {5,3,1} 
5 9 = {4,2} 
= {3,1} 
= {2} 
S e = {1} 

S k = for k < 6 and T k = for fc < 9. Thus for g(j) := f s (j) + f T (j) we 
have 5(4) = 5(8) = g(9) = — 1 and g(7) = g(10) = 1; all other values of g(j) for 
j < 12 are zero. Note that for 1 < j < 7 we have fr{k — Sj) 5^ if and only if 

k — Sj G T fe ; hence for i^" } fe := 2 Y?jZl{-l) 3 h{k - sj) it follows that i?^") = 

for i = 1,2,- •• ,8 and that = -2/ T (4) = 2, i?i m) 10 = -2/ T (5) = -2 and 

fi^j = 2/ T (4) = -2. Therefore, i?„_ fc = .g(fc) + E^"]. has by Corollary [5] for 
k = 1, 2, • • • ,11 the following values: 

-1 1 -1 1 -1 -2. 

4.4. Skew-Symmetric Sequences and Run Length Encoding. An odd length 
binary sequence a of length 2m — 1 is called skew- symmetric if for i = l,2,---,m — 1 

(4.10) a m -i = (-l)*a m+i . 

Skew-symmetric sequences are of particular interest in different areas. For ex- 
ample, consider the merit factor F(a) which is defined by 

(4.11) F(a) := . 

In many applications it is of interest to collectively minimize the absolute values of 
the autocorrelations; the merit factor can be used as a possible measure, see [lj[2l|3] 
for a survey on this topic. Let F n be the highest merit factor possible for all binary 
sequences of length n. We say that a binary sequence a has an optimal merit factor 
if F(a) — F n (where as always n denotes the length of a). Many of the known 
odd length binary sequences with an optimal merit factor are skew-symmetric and 
it is even conjectured in |11) that a restriction to skew-symmetric sequences does 
not change the asymptotic behavior of F n . Furthermore, all odd length Barker 
sequences are skew-symmetric; a Barker sequence of length n is a binary sequence 
with |C7fc(a)| < 1 for all k = 1, 2, • • ■ , n — 1. Since for a binary sequence a we have 
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Cfc(a) 4- C n ~k{a) = 4 mod n for fc = 1, 2, • • • , n — 1, a Barker sequence has an 
optimal merit factor. 

For a skew-symmetric sequence a it is not difficult to see that Cfe(a) = if fc 
is odd. By (|3.ip it follows that 7 = = m. Furthermore, by Theorem Q] we 
have C fe+ i(a) - 2C fe (a) + C fe _i(a) = -2R k for fc = 1, 2, • • • ,n - 1. Therefore, 
Rk(a) = Cfe(a) if fc > is even and Rk{a) = — |(C/-_i(a) + Cfe + i(a)) if fc is odd. 
In particular, for an odd length Barker sequence a we have Rk(a) — (— l) fe +7+i for 
fc = 2, 3, • • • , n — 1 and Ri(a) = —7 if 7 is odd and Ri(a) — 1 — 7 if 7 is even. 

Next we want to describe skew-symmetric sequences in terms of their run length 
encoding. The run length encoding r of a is called skew- symmetric if a itself is 
skew-symmetric. Note that this is well defined. We call the run length encoding r 
balanced if 

(4.12) S U T = {1, 2, ■ • • , n - 1} and S (IT = 0. 

Note that if r is balanced, then n = 27 — 1. Furthermore, r is balanced if and only if 
we have for each k — 1,2, ■ ■ ■ ,n — 1 either k £ S or n — keS. Hence if r is balanced, 
then by flH]) / s (fc) ^ ^ f T (k) = <=> f s (n - k) = for k = 1, 2, • • • , n - 1. 

We call a run length encoding r = (r*i, 7-2, • • • , r 7 ) reducible if either ri = 1 and 
r 7 > 1 or ri > 1 and r 7 = 1. If r is reducible, then the reduced run length encoding 
f is given by f := (r%, Ts, ■ ■ ■ , r 7 — 1) if r\ = 1 and by f := (ri — 1, ra } ■ • • , r^_i) if 
ri > 1. Given r we will in the following always denote the so defined reduced run 
length encoding by f. Let C denote the set of all run length encodings of binary 
sequences of length n > 1. Furthermore, let C s be the set of all r £ C which are 
skew-symmetric and let Cb be the set of all r G C which are balanced. Now let r be 
a run length encoding with 7 > 1; then it quite easily follows that 

(4.13) r G C s r is reducible and f G C s 

(4.14) r G Cb r is reducible and f G Cb- 

We will show that C s = Cb, i.e. that a binary sequence is skew-symmetric if 
and only if its run length encoding is balanced. In particular, we will inductively 
construct a set I such that C s = I and Cb = Z. Let 1^ := {(1)}, i.e. 1^ is the 
one-element set containing the run length encoding of the two binary sequences of 
length 1. For fc > 1 let l( fc+1 ) be the set of all reducible r G C with f G X' fe - ) and 
finally let 1 := {J X {k \ It is easy to see that the elements of X can be interpreted 
k>\ 

as the nodes of an infinite binary tree: r = (1) is the root, and if r G I with 7 > 1, 
then r is its parent. Moreover, 1^ presents the set of nodes having depth fc — 1. 
If r G X then it easily follows by induction that r G I^'. Furthermore, we have for 
7 > 1 that r G X if and only if f G I. 

The next proposition shows that the skew-symmetric sequences are exactly the 
binary sequences which have a balanced run length encoding. 

Proposition 9. C s = Cb- 

Proof. First we will show that C s C X, Assume that this is not the case and choose 
r G C s with minimal 7 such that r ^ X. Then 7 > 1 and by (|4.13t r is reducible 
and f G C s . Since r^lwe also have f ^ X which contradicts the minimality of 7. 
Next we want to show that IC£ S . Similar as before, assume that this is not the 
case and choose r G X with minimal 7 such that r ^ C s . Then 7 > 1, r is reducible 
and f G X. By (|4. 13|) f is not skew-symmetric which contradicts the minimality 
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of 7. Therefore, we have C s = X, Using (|4.14[) similar arguments show that also 
C s = I which completes the proof. □ 

5. The Periodic Case 

The situation in the periodic case is quite similar to the aperiodic one. In order to 
formulate and prove the relationship between the periodic autocorrelations of binary 
sequences and their run structure we have to adapt some of the previous definitions 
to the periodic case. In the following we will always assume that a is a binary 
sequence of length n which is not constant, i.e. Oi ^ Oj for some 1 < i < j < n. 
This is no loss of generality as we will see at the end of this section. We could 
prove the following results in a very similar way as we have done in the aperiodic 
case. However, by using (|1.3[) we can directly transfer the aperiodic results of the 
previous section to the periodic case. 

In the following we will in addition always assume that a\ ^ «„. Again this is 
no loss of generality since the periodic autocorrelations are shift-invariant, i.e. for 
b := (a„, 01, 0,2, • • • , we have Ck(b) = Ck(a) for all k = 0, 1, • • • , n. The main 

reason for the additional assumption a\ ^= a n is that we can in this case adopt 
the definition of the the run length encoding r — (r±, r2, ■ ■ • , Tj) of a without any 
modification. Note that 7 is always even if a\ ^ a n . 

5.1. Preliminaries for the Periodic Case. As in the periodic case substrings 
of the run length encoding r — (ri, T2, ■ ■ • ,r 7 ) of a play an important role. For 
1 < i < 7 we will in the following always put r 1+ i := Tj. Similar to the concept 
of half-open interval on a circle we have to adapt the definition of a substring to 
the periodic case. In order to distinguish this new definition from the previous one 
we will refer to p-substrings. As we will see, p-substrings r(i,j) of r will be only 
defined if and only if 1 < i,j < 7 and i ^ j. 

For 1 < i < j < 7 the p-substring r(i,j) is the same as in the periodic case; 
thus representing (ri,fi+i, ■ • • ,rj_i). For 1 < j ' < i < 7 the p-substring r{i,j) 
will represent the non-empty contiguous part (ri,rj+i, • • • ,r 7+ j_i) of the periodic 
extension of r. Unless otherwise stated we will in the following always assume that 
1 < i, J < 7 an d i 7^ j. As before \r(i,j)\ denotes the length of the p-substring, 
hence 

1 / - -\ jj-i if 1 < i < j < 7 

I 7 + J — 1 iil<J<*<7- 

In particular we have 1 < \r(i, j)\ < 7. Note that the p-substrings r(i,j) and 
r(j, i) are complementary: they have no element in common and their concatenation 
represents all elements of r; in particular we have 

(5.1) \r(i,j)\ + \r(j,i)\ = 7. 

5.2. The Main Result for the Periodic Case. The periodic run structure TZ of 
a will be defined as the set of all p-substrings of r. Similar to the periodic case we 
will analyze the connection between the periodic autocorrelations Ck (a) of a binary 
sequence a of length n and their periodic run structure given by the run length 
encoding r = (n, r^, • • • , r 7 ) of a. The case k = 1 is easy; similar to the aperiodic 
case we have 



(5.2) 



Ci(a) =n-2 7 
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since for each j = 1, 2, • • • ,7 the jth run of a contributes rj — 2 to the sum C\(a). 
Thus (51(a) = Y,]=i(rj - 2) = n - 2 7 . 

The sum S(r(i,j)) for a p-substring r(i,j) will be defined as 

-1 



S(r(i,j)) 




if f < i < j < 7 
if 1 < j < i < 7. 



Let 1 < k < n; we will denote by 7?.fc the set of all p-substrings r(i,j) of r with 
S(r(i,j)) = k (where as always 1 < i,j < 7 and i 7^ j) and put 

(5.3) R k := (-!) H - 

Similar as in the aperiodic case a p-substring of r corresponds in a one-to-one way 
to consecutive runs of a. Note however, that consecutive has in the periodic case 
a slightly different meaning due to the periodic extension of a. Similar to the 
aperiodic case Rk can be interpreted as the sum of weights of those consecutive 
runs of a which (total) length equals fc; the weight is in the periodic case either 1 
or -1 depending on whether the number of runs is even or odd. In particular, —R\ 
equals the number of runs of length 1. 

Since S(r(i,j))+S(r(J,i)) — n, we have r(i, j) Git^ifand only iir(j,i) € R n -k- 
In particular, this shows that \R-k\ — \R-n-k\- Moreover, since 7 is even, it follows 
from (pTT]) that 

(5.4) (_i)K*>i)l = (_i)l r O'. i )l 

and hence R^ = R n -k- 

Similar to the periodic case we put R(a) := (i?i, R2, • • • , R n -i)- Let us compute 
R(a) for the following two sequences of the previous examples. For a = (+ + + + 

+ H ) we have n = 13, 7 = 2 and r = (6, 7). Furthermore, we have 

TZk = for 1 < k < 12 unless k = 6 or k — 7; in these cases we have IZq = {r(l, 2)} 
and n 7 = {r(2,l)}. Hence R(a) = (0,0,0,0,0, -1, -1,0,0,0,0,0) by flS^. For 

a=(+ + H h + H ) we have n — 15, 7 = 4 and r = (3, 6, 3, 3). 

In order to calculate for instance Rq we have by (|5.3|) to consider all consecu- 
tive runs of a which have a total length of 6. In this case these consecutive runs 
correspond to the p-substrings r(2,3), r(3, 1) and r(4, 2); the p-substrings r(2,3) 
consists of a single run, whereas the p-substrings r(3, 1) and r(4, 2) each consists of 
two consecutive runs. This shows that IZ5 — {r(2, 3), r(3, 1), r(4, 2)} and Rq = 1. 
Similarly we have K 3 = {r(l, 2), r(3, 4), r(4, 1)}, K 9 = {r(l, 3), r(2, 4), r(3, 2)}, 
7Zi2 — {r(l, 4), r(2, 1), r(4, 3)} and TZk = if fc is not a multiple of 3; hence 
R(a) = (0, 0, -3, 0, 0, 1, 0, 0, 1, 0, 0, -3, 0, 0). 

Lemma 10. 2Rk = Rk + Rn-k for k = 1, 2, • • • , n — 1. 

Proof. Let 1 < k < n. We can write TZk as the (disjoint) union TZk — TZ^ UTZ^ 11 ^ 
where TZ^ n ^ denotes the set of all inner substrings in TZk and TZ ^ ut ^ denotes the set 
of all outer substrings in TZk- Similarly we can write TZk as the (disjoint) union 

n k = nf l) U TZ[ out) u £<f™ d ) 

where TZ k m) := {r(i,j) e TZ k : 1 < t < j < 7}, := {r(t,j) £K t :i = 

1 or j = 1} and TZ k amd) := {r(i,j) G : 1< j < i < 7}. 
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If u G TZ^ n ^ then u is an inner substring; moreover, we have u £ W£ if and 
only if u G TZ^^ and thus 



2 E (-i) N = E «(«)-(-i) ,u| - 



If r(i, j) € -j^flmid) r ^ ^ j g an mner substring; furthermore we have r(i, j) G 

2 E (-i) H = E «(«)-(-i) |u| . 

For r(i,j) G K[™''we consider the two possible cases i = 1 and j = 1 separately: 
r(l, j) G 7^ ut) ^ r(l, j) G ^ out) «. r(j )7 + 1) G ^ 

r(i, 1) G 6<° ut) r(*, 7 + 1) G ft< out) r(l,t) G ■ 
Hence by (|5l| 

E (-1) H = E a(u)-(-l)H= £ 
Summing up we obtain 

2 E - E «(«)-(-i) M + E «(«)-(-i) |u| - 

□ 

Since 7 is even it easily follows from (|3.7p and Lemma ITD1 that Y^k=i Rk — ~1- 
The next theorem (cf. [TJ) which is the main result of this section shows that for a 
binary sequence a the periodic autocorrelations and R(a) are closely related. 

Theorem 11. Let k = 1, 2, • • • , n — 1; t/ien 

<5fc+i(a) - 2C fe (a) + C fc _i(o) = -4fl fc . 

Proof. Let 1 < k < n. By Theorem[T]we have Cfc+i(a) - 2Cfc(a) + Cfe_i(a) = -2i? fe 
and C( n _fc)_|_i(a) — 2C n -k(a) + CV n _fc)_i(a) = — 2R n -k- Hence the theorem follows 



by (L3J} and Lemma [TUJ □ 

By (|5.2p we have C'i(a) = n — 27 and Theorem HT1 gives us that 62(0) = 2Ci(a) — 
n — 4i?i and therefore C2 (a) = n — 4j — 4i?i . 

Let us denote by C(a) := (Cq (a) , C\ (a) , ■ • • ,C n (a)) the 'periodic autocorrelation 
vector of a. 

Corollary 12. A 2 (C(a)) = -4R(a) /or n > 3. 

Proof. This is just a reformulation of Theorem 1111 □ 
Corollary 13. Let k = 0, 1, • • ■ , n; i/ien 

fc-i 

C i (a)=n-2 7 fc-4^(/ ! -j)4 
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Proof. Since Co (a) = n and by (|5.2|) C\(a) = n — 2j the statement is true for 
k = 0, 1. For k > 2 it follows directly from A 2 ((7(a)) = 2R(a) and Lemma M □ 

Finally, as in the aperiodic case a complementary formulation for Ck (a) in Corol- 
lary [T3] can be easily derived; since C n (a) — n and C„_i(a) = Ci(a) = n — 27 
equation (|3.5p gives us that for fc = 0, 1, • • • , n 

n-i 

C* fe (a) = n - 2 7 (n - fc) - 4 ^ (j - 

j=fc+l 

Remark. In this section we have for technical reasons assumed that the binary 
sequence a is not constant. This however is no loss of generality since Theorem 
1111 and the following results are also true if a is a constant binary sequence. This 
follows immediately since for a constant binary sequence a of length n we have 
Ck(a) = n and also TZk = and thus Rk = for k = 1, 2, ■ • • , n — 1. 
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